Local Minima-Based Exploration for Off-Lattice Protein Folding
نویسندگان
چکیده
We present a new and simple algorithmic approach to help predict protein structures from amino acid sequences based on energy minimization. In the search for the minimal energy conformation, we analyze and exploit the protein structures found at the various local minima to direct the search the global minimum. As such, we explore the energy landscape efficiently by considering only the space of local minima instead of the whole feasible space of conformations. Our specific algorithmic approach is comprised of two different elements: local minimization and operators from genetic algorithms. Unlike existing hybrid approaches where the local optimization is used to fine-tune the solutions, we focus primarily on the local optimization and employ stochastic sampling through genetic operators for diversification. Our empirical results indicate that each local minimum is representative of the substructures contained in the set of solutions surrounding the local minima. We applied our approach to determining the minimal energy conformation of proteins from the Protein Data Bank (PDB) using the CHARMM and UNRES energy model. We compared against standard genetic algorithms and Monte Carlo approaches as well as the conformations found in the PDB as the baseline. In all cases, our new approach computed the lowest energy conformation. Proteins have specific native folding structures, but without proper folding, these proteins function incorrectly and lead to disease. The major challenge lies in effectively predicting such native conformations. Particularily we employ an iterative algorithm in searching for the global minimum; begin with an initial pool of local minima; construct a new pool of solutions by combining the various building blocks found in the original pool; map all combined solutions to their representative local minima; and, repeat the process. (For specific details on our algorithm, see [4][5].) Our procedure seems to share a great deal of commonality with evolutionary computing techniques because we employ genetic operators for diversification. However, most hybrid evolutionary computing algorithms used local minimization algorithms as the solutions of “fine-tune”, we concentrate primarily on constructing local minima from previously explored minima and only use genetic operators to assist our seacrh. For our baseline comparisons, we referenced the structure from the Protein Data Bank (PDB) for each protein sequence. These structures represent the predicted native structures for each protein known to date. Furthermore, because these structures were determined using various molecule models, their energy values are not directly related to CHARMM[1] or UNRES[3] energy function. Therefore, we computed the UNRES energy value from the given PDB structure and further minimized it using our gradient descent algorithm[2] for UNRES. For CHARMM, we computed the CHARMM energy value from the given PDB structure and further minimized it using our gradient descent algorithm[2] for CHARMM. Given the debates regarding the efficacy of different energy models such as CHARMM and UNRES in accurately predicting the native structure of a protein, our focus for this testbed is to determine the effectiveness and efficiency of our algorithmic approach for CHARMM and UNRES and ultimately any other realistic energy model. Hence, assuming that the molecular energy model adequately captures the necessary physical properties and real-world constraints, the energy value itself is our primary measurement criteria. We picked up several proteins arbitrarily from the Protein Data Bank (PDB) and minimized using the CHARMM and UNRES model. We compared against Standard Genetic Algorithms(SGA) and Metropolis Monte Carlo with Simulated Annealing(MC/SA) approaches. In all cases, our new approach computed the lowest energy conformation. Given the prohibitive amount of time to conduct multiple runs of each method over all 100 proteins, each method was run Proceedings of the Computational Systems Bioinformatics (CSB’03) 0-7695-2000-6/03 $17.00 © 2003 IEEE exactly once using the parameter settings determined from pre-trial runs. Hence, the weaknesses and strengths of each method is averaged over the testbed. With our experimental results our total number of iterations/generations were demonstrated (empirically) to be quite low ( ) whereas Standard Genetic Algorithms and Monte Carlo with Simulated Annealing are very high number of generations in order to provide sufficient opportunity for these methods to achieve their best solution. The computation time of LMBE for UNRES varied from 10 mins to 13 hrs, depending on the protein length, amino acid sequence and the genetic parameters (i.e. crossover rate, mutation rate). For MC/SA, the time was between 21 mins and 16 hours. Table 1 shows the obtained energy values of LMBE compared with SGA, MC/SA, and the baseline from PDB (due to space limitations, we have only presented the first 50 proteins from our experiments) . For all 100 proteins, LMBE computed the best energy conformation with the UNRES model. Moreover, it is interesting to observe that the improvement using LMBE seems to improve significantly for longer proteins on comparison to the existing baseline. Furthermore, our results on the CHARMM energy model have been equally promising. Finally, our goal is to ultimately obtain a better understanding of the computational requirements and complexities posed by different molecular energy models. With such an understanding, we can provide critical input during the design of future realistic energy models that avoid unnecessary algorithmic complexities with a guide towards efficiency for the computational biologist.
منابع مشابه
Local Minima-Based Exploration for O -Lattice Protein Folding
Proteins have speci c native folding structures, but without proper folding, these proteins function incorrectly and lead to disease. The major challenge lies in e ectively predicting such native conformations. We present a new and simple algorithmic approach to help predict protein structures from amino acid sequences based on energy minimization. In the search for the minimal energy conformat...
متن کاملCoarse grained description of protein folding
We consider twoand three-dimensional lattice models of proteins that were characterized previously. We coarse grain their folding dynamics by reducing it to transitions between effective states. We consider two methods of selection of the effective states. The first method is based on the steepest descent mapping of states to underlying local energy minima and the other involves an additional p...
متن کاملLocal interactions and protein folding: A three-dimensional off-lattice approach
The thermodynamic behavior of a three-dimensional off-lattice model for protein folding is probed. The model has only two types of residues, hydrophobic and hydrophilic. In absence of local interactions, native structure formation does not occur for the temperatures considered. By including sequence independent local interactions, which qualitatively reproduce local properties of functional pro...
متن کاملCoarse grained description of the protein folding
We consider twoand three-dimensional lattice models of proteins which were characterized previously. We coarse grain their folding dynamics by reducing it to transitions between effective states. We consider two methods of selection of the effective states. The first method is based on the steepest descent mapping of states to underlying local energy minima and the other involves an additional ...
متن کاملOptimization of Potential-Energy Parameters for Folding of Several Proteins
We introduce a novel approach to the study of the folding of proteins whose native structures are already known. We use an off-lattice atomistic potential energy. The parameters of the potential energy are simultaneously optimized for several proteins. The low-lying local-energy minima for these proteins are found by conformational space annealing. The parameters are modified in such a way that...
متن کاملLocal Interactions and Protein Folding: A 3D Off-Lattice Approach
The thermodynamic behavior of a three-dimensional off-lattice model for protein folding is probed. The model has only two types of residues, hydrophobic and hydrophilic. In absence of local interactions, native structure formation does not occur for the temperatures considered. By including sequence independent local interactions, which qualitatively reproduce local properties of functional pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003